Best Vision-Language Foundation Models AI Tools & Models - Premium Vision-Language Foundation Models News

AI News

ByteDance Releases Open Source Multi-modal Model BAGE From Image Generation to World Modeling

ByteDance recently officially released its latest open source multi-modal foundation model - BAGEL (Big Advanced Generalized Embodied Learner), starting a new stage for multi-modal AI models with a scale of 7 billion effective parameters. BAGEL performs excellently in key tasks such as image understanding, generation, and editing, and has surpassed current mainstream open source vision-language models (VLM) like Qwen2.5-VL and InternVL-2.5 in multiple standard evaluations.

12.2k 2 days ago

ByteDance Releases Open Source Multi-modal Model BAGE From Image Generation to World Modeling

Breaking! AI Scientists Take a New Approach, Using Large Models to Automatically Explore Artificial Life

Recently, scientists at Sakana AI have made groundbreaking advancements in the field of artificial intelligence, successfully utilizing vision-language foundation models (FMs) for the first time to achieve automated searching for simulations of artificial life (ALife). This project is called ASAL (Automated Search for Artificial Life).

14.7k 16 hours ago

Models

qwen3-livetranslate-flaltimeash-re-2025-09-22

Alibaba

Input tokens/M

$240

Output tokens/M

Context Length

Doubao - Seedream - 3.0 - t2i

Bytedance

Input tokens/M

Output tokens/M

Context Length

Qianfan-VL-70B

Baidu

Input tokens/M

Output tokens/M

Context Length

Qwen-Image

Alibaba

Input tokens/M

Output tokens/M

Context Length

Pangu-NLP-N2-32K-5.0.1.1

Huawei

Input tokens/M

Output tokens/M

Context Length

GLM-4.5-Flash

Chatglm

Input tokens/M

Output tokens/M

128

Context Length

Qwen2.5-VL-32B-Instruct

Alibaba

Input tokens/M

Output tokens/M

Context Length

Gemini 2.5 Pro

Google

$8.75

Input tokens/M

$70

Output tokens/M

Context Length

Qianfan-QI-VL

Baidu

Input tokens/M

Output tokens/M

Context Length

Pangu-NLP-N4-Reasoner-128K-3.0.1.2

Huawei

Input tokens/M

Output tokens/M

128

Context Length

Pangu-NLP-N4-4K-3.2.36

Huawei

Input tokens/M

Output tokens/M

Context Length

GLM-Z1-Flash

Chatglm

Input tokens/M

Output tokens/M

128

Context Length

kimi-k2-0905-preview

Moonshot

Input tokens/M

$16

Output tokens/M

262

Context Length

Baichuan2-7B-Base

Baichuan

Input tokens/M

Output tokens/M

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

ByteDance Releases Open Source Multi-modal Model BAGE From Image Generation to World Modeling

Breaking! AI Scientists Take a New Approach, Using Large Models to Automatically Explore Artificial Life

Models

qwen3-livetranslate-flaltimeash-re-2025-09-22

Doubao - Seedream - 3.0 - t2i

Qianfan-VL-70B

Qwen-Image

Pangu-NLP-N2-32K-5.0.1.1

GLM-4.5-Flash

Qwen2.5-VL-32B-Instruct

Gemini 2.5 Pro

Qianfan-QI-VL

Pangu-NLP-N4-Reasoner-128K-3.0.1.2

Pangu-NLP-N4-4K-3.2.36

GLM-Z1-Flash

kimi-k2-0905-preview

Baichuan2-7B-Base

Gelato 30B A3B